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Time-series data with regular and/or seasonal long-memory are often aggregated before anal- 
ysis. Often, the aggregation scale is large enough to remove any short-memory components of 
the underlying process but too short to eliminate seasonal patterns of much longer periods. 
In this paper, we investigate the limiting correlation structure of aggregate time series within 
an intermediate asymptotic framework that attempts to capture the aforementioned sampling 
scheme. In particular, we study the autocorrelation structure and the spectral density function 
of aggregates from a discrete-time process. The underlying discrete-time process is assumed to 
be a stationary Seasonal AutoRegressive Fractionally Integrated Moving-Average (SARFIMA) 
process, after suitable number of differencing if necessary, and the seasonal periods of the un- 
derlying process are multiples of the aggregation size. We derive the limit of the normalized 
spectral density function of the aggregates, with increasing aggregation. The limiting aggregate 
(seasonal) long-memory model may then be useful for analyzing aggregate time-series data, 
which can be estimated by maximizing the Whittle likelihood. We prove that the maximum 
Whittle likelihood estimator (spectral maximum likelihood estimator) is consistent and asymp- 
totically normal, and study its finite-sample properties through simulation. The efficacy of the 
proposed approach is illustrated by a real-life internet traffic example. 

Keywords: asymptotic normality; consistency; seasonal auto-regressive fractionally integrated 
moving-average models; spectral density; spectral maximum likelihood estimator; Whittle 
likelihood 

1. Introduction 

Data are often aggregated before analysis, for example, 1-minute data aggregated into 
half-hourly data or daily data aggregated into monthly data. Aggregation of data may 
be carried out for ease of interpretation on a scale that is of interest, for example, policy 
makers and/or the public are more interested in monthly unemployment rate than daily 
unemployment rate. On the other hand, data may be naturally aggregated, for example, 
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tree-ring data, which are often hard to disaggregate. On a fine sampling scale, many 
time series are of long memory in the sense that their spectral density functions admit 
a pole at the zero frequency. A popular class of discrete time long memory processes are 
autoregressive fractionally integrated moving average (ARFIMA) models (see Granger 
and Joyeux [6], Hosking [7]). Man and Tiao [11] and Tsai and Chan [18] showed that 
temporal aggregation preserves the long-memory parameter of the underlying ARFIMA 
process. Ohanissian, Russell and Tsay [13] made use of this property in developing a 
test for long-memory. Furthermore, as the extent of aggregation increases to infinity, the 
limiting model retains the long-memory parameter of the original process, whereas the 
short-memory components vanish. 

In practice, the underlying process may admit seasonal long memory in that its spec- 
tral density function may have poles at certain non-zero frequencies. Such data may 
be modeled as some Seasonal Auto-Regressive Fractionally Integrated Moving-Average 
(SARFIMA) process, see Section 2. If the aggregation interval is much larger than the 
largest seasonal period, aggregation will intuitively merge the seasonal long-memory com- 
ponents with the regular long-memory component and eliminate the regular or seasonal 
short-memory components of the raw data. For example, within the framework of ARIMA 
models, Wei [19] showed that aggregation removes seasonality if the frequency of aggre- 
gation is larger than or the same as the seasonal frequency. 

On the other hand, if the aggregation interval is large but is just some fraction of the 
seasonal periods of the original data, the aggregates may be expected to keep the seasonal 
short- and long-memory pattern, albeit with different periods. For many data, the latter 
scenario may be more relevant for analysis. For example, aggregating 1-minute data into 
half-hourly data may remove the short memory component on the minute scale but the 
daily or monthly correlation pattern of the raw data may persist in the aggregates. 

Here, our purposes are twofold. First, we study the intermediate asymptotics of aggre- 
gating a SARFIMA process. In particular, we derive the limiting (normalized) spectral 
density function of an aggregated SARFIMA process via the asymptotic framework where 
the seasonal periods of the SARFIMA model are multiples of the aggregation interval 
and the aggregation interval is large. While the original time series is assumed to be a 
SARFIMA process, the limiting result is robust to the exact form of the short-memory 
and the regular long-memory components. The limiting spectral density functions then 
define a class of models suitable for analyzing aggregate time series that may have regular 
or seasonal long-memory and short-memory components. Second, we derive the large- 
sample properties of the spectral maximum likelihood estimator of the limiting aggregate 
SARFIMA model, obtained by maximizing the Whittle likelihood. 

The rest of the paper is organized as follows. The SARFIMA model is reviewed in 
Section 2. In Section 3, we derive the limiting spectral density function of an aggregate 
SARFIMA process, under the intermediate asymptotic framework. Spectral maximum 
likelihood estimation of the limiting aggregate SARFIMA model and its large-sample 
properties are discussed in Section 4. We compare the empirical performance of the 
spectral maximum likelihood estimator of the limiting model with that of the SARFIMA 
model by Monte Carlo studies in Section 5. The simulation results suggest that fitting 
the limiting model to the aggregate data generally reduces the bias in some long-memory 
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parameters than simply fitting a SARFIMA model. We illustrate the use of the limiting 
aggregate SARFIMA model and its possible gains in long-term forecasts with a real 
application in Section 6. We conclude in Section 7. All proofs are collected in the appendix 
of Chan and Tsai [5] . 

2. Seasonal autoregressive fractionally integrated 
moving average models 

We now briefly review the SARFIMA model which is widely useful in scientific analysis; 
see Porter-Hudak [16], Ray [17], Montanari, Rosso and Taqqu [12], Palma and Chan [15], 
Bisognin and Lopes [2] and Lopes [10]. Let {Y t ,t = 0,±1,±2, . . .} be a seasonal autore- 
gressive fractionally integrated moving average (SARFIMA) model with multiple periods 

^1 j • ■ • ) &c 



(B)(l - B) d U$i(B s *)(l - B s *) D *Y t = 6(B)l[e i (B s *)e t , 



(1) 



i=l 



i=l 



where d and Di,i = l,...,c, are real numbers, s c > s c _i > • • • > s± > 1 are integers, 
{e*} is an uncorrelated sequence of random variables with zero mean and common, finite 



- 2 > 0, 



variance a c 
= 1 

shift operator, and (1 



1 



iz <P P z' , v{z) = L + Oiz-\ hfrgZ', and tor % = 1, . . . , c, 

®i,PiZ Pi , &i(z) = 1 + Qi.iz H h 0^ Qi z Qi , B is the backward 

B) d is defined by the binomial series expansion 



n - m d -V r(fc-d) 



B 



where T(-) is the gamma function. Stationarity of {Y t } requires < 1/2 for all i and 
(1 + ^2^=1 Di < 1/2, see Palma and Bondon [14]. We assume that none of the roots of cj>(-) 
and « = 1, • • ■ , c, match any roots of #(•) and Qi(-), i = 1, . . . , c. Moreover, all roots 

of of the above polynomials are assumed to lie outside the unit circle. The conditions on 
the roots, the fractional orders d and D's ensure that {Y t } is stationary and the model 
is identifiable. It can be readily checked that the spectral density of {Y t } equals, for 

— 7T < UJ < 7t, 



h(u) 



2n 



0(exp(iw)) 



0(exp(iw)) 



2 sin 
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x J ^J|(exp(ii/j-fc) - exp(iw))(exp(-ii/jfc) - exp(iw))| 

3=1 fe=l 



-25, 
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where So = d + D\ + ■ ■ ■ ■+ 
for j = 1 , . . . , c, and = 1, 
1, and 5j Tj = Dj/2 if Sj = 



-D c ; Tj = [sj/2], the greatest integer < Sj/2; i/jk = 2irk/sj 



5jk 



Dj , for k 



1, Si 



Dj if Sj 



2r 7 - 



2Tj. From (2), we see that, as uj — > 0, the spectral density 
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f(oj) = 0(\uj\~ 2d ~ 2Dl 2Dc ), whereas for j = 1, . . . , c, k = 1, . . . ,Tj, asw-> Vj k , /(w) = 
0(\to — Vjk\~ 2Dj ). Given our interest in long-memory processes, throughout this paper, 
the parameters d and the Dj's are restricted by the inequality constraints: < d + D 1 + 
■ ■ ■ + D c < 1/2, and < Dj < 1/2, for j = l,...,c. 



3. Aggregates of SARFIMA models 

For non-stationary data, we assume that, after suitable regular and/or seasonal differenc- 
ing, the data become stationary and follow some stationary SARFIMA model. Specifi- 
cally, let r and Rj, j = 1, . . . , c, be non-negative integers and {Y t , t = 0, ±1, ±2, . . .} a time 
series such that (1 - B) r (l - B Sl ) Rl ■ ■ ■ (1 - B^) R "Y t is a stationary SARFIMA model 
defined by equation (1). Therefore, {Y t } satisfies the difference equation 



4>{B) (1 - B) r+d Y[ $j(B s J)(l - IT " ) , = 6(B) Y[ Qj{B s ')e u 



(3) 



which is referred to as the SARFIMA(p, r + d, q) x (Pi, + D x , Qi) Sl x • • • x (P c , R c + 
D c ,Q c ) Sc model. 

Let m > 2 be an integer and A™ = 53]^m(T-x)+i ^ k ^ e the nonoverlapping m-temporal 
aggregates of {Y t }. Let V = 1 — B be the first difference operator, and V s = 1 — B s the lag- 
s difference operator. Let R = (r, R\, . . . , R c ), £ = (d; Dj, j = 1, . . . , c; = 1, • • • , c,j = 



P.O. 

_L , . . . , ± 2 , W z 



1, . . . , c, j = 1, . . . , Qi), and assume Si 



1, . . . , c, where the z^s 



are positive integers. Below we derive the spectral density of the aggregates, and the limit 
of the normalized spectral densities with increasing aggregation. The normalization that 
makes the spectral densities integrate to 1 is necessary because, without normalization, 
the variance of the aggregates generally increases to infinity with increasing aggregation. 



Theorem 1. Assume that {Y t } satisfies the difference equation defined by (3). 

(a) For r > 0, Ri > 0, i = 1, . . . ,c, and m = 2h + 1, the spectral density function of 
{V'V^ 1 ■ • • y^Xrp} is given by 



fi,R,m{u) 



2 sin 



h 
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k=-h 



2r+2 c 



n 



2 sin 



-2D, c 

n 



f lu + 2kn 

2 sin 

V 2m 



-2r-2d-2 



Oj(eXjp(lZjUl)) 



(4) 



where g(uj) = o- 2 (2n)~ 1 \8(cx-p(iuj))\ 2 \(f)(exp(iuj))\~ 2 and —n < w < n. 

If m = 2h, the spectral density is given by equation (4) with the summation ranging 
from —h +1 to h for —TC < lo < and from —h to h — 1 for < lo < n. 
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(b) As m —> oo, the normalized spectral density function of {V r V^ l 1 • • • V^ C X™} 
verges to f(,ii(u)) = K iiR f^ R (uj), where 



11 2 



2r+2 c 



><n 



n 

@j(exp(izja;)) 



-2D, 



(3) 



|w + 2fc7t|" 



2r-2d-2 



k— — c 



where K^ r is the normalization constant ensuring that J^ n f^^(oj)du} = 1. 

Remark 1. The assumption that Sj = mzt, for i = 1, . . . , c, and m — >• oo in Theorem 1(b) 
should be interpreted as follows: the periodicities s^s arc multiples of the aggregation 
size to, and the aggregation size is large. Consider two examples. Example (1): hourly 
data (that have a quarterly seasonality) are aggregated into monthly data, so s = 2160, 
m = 720, and z = 3, and Example (2): half-hourly data (that have a weekly seasonality) 
are aggregated into daily data, so s = 336, m = 48, z = 7. 

Remark 2. Note that z c > z c _i > • • • > z\ > 1. For j = 1, . . . ,c, k = 0, 1, . . . , [zj/2], let 

Ujk = Vj(mk) 



2nk j Zj , then both 



,-2r-2d-l 



/f,fl,m and f^M are of order 



0(|cj| _2d_2 ' Dl 2 c ), for u> ->• 0, and of order 0(|w - Wjfcl -2 ^), for u -> Wjfe, j = 
1, . . . ,c, fc = 1, . . . , [zj/2]. The above observations indicate that, if the periodicities Sj's 
are multiples of the aggregation size to, then the aggregates and their limits preserve 
the long-memory and seasonal long-memory parameters of the underlying SARFIMA 
process, whereas the Zj's become the periodicities of the aggregated series. 



Remark 3. If z\ = 1, the corresponding seasonal long- memory component is confounded 
with the regular long-memory component for the limiting aggregate process. Hence, with- 
out loss of generality, we shall set D\ = if Z\ = 1 in applications. 



Remark 4- If r = 0, then the limiting model of the aggregates of {Y t } is simply a 
SARFIMA (Pi, Ri +D\, Q\) Zl x ■••x (P c ,Rc + D c ,Q c ) Zc process with fractional Gaussian 
noise as the driving noise process, where the self-similarity parameter (Hurst parameter) 
of the underlying fractional Gaussian process equals H = d + 1/2. Sec Beran [1] for 
definition of the fractional Gaussian noise. 



4. Spectral maximum likelihood estimator and its 
large sample properties 

We are interested in applying the long-memory limiting aggregate process derived in 
Section 3 to data analysis. For this purpose, we assume (i) < d + D\ + ■ ■ ■ + D c < 1/2 
and (ii) < Dj < 1/2 for j = 1, . . . , c. The limiting aggregate process is of long memory 
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regularly or seasonally if either < d + D\ + • • • + D c < 1/2 or < Dj < 1/2 for some 
je{l,...,c}. We also introduce the parameter a to account for the variance of the data. 
Furthermore, we assume Zj, j = l,...,c, arc known. Consider a time series {Yi}^L 1 _ s , 
where S is a positive integer to be defined below, such that, conditional on {Yi}® =1 _ s , 
{VV^ 1 • • • V^fli}^ is a stationary process with its spectral density defined by 



f(oj;^R,a 2 )=a 2 f*(u;^,R), 



(6) 



where f*(uj;£,R) is define in (5), S = max r + J2i=i z i ' max i?i; max r and max^, i = 
1, . . . ,c, are the largest possible values of r and Ri, i = l,...,c, respectively, which we 
will consider in simulation studies and real data analysis in Sections 5 and 6. That is, 
the spectral maximum likelihood estimators f and Ri, i = l,...,c, satisfy the conditions 
that f £ {0, . . . , max r } and Ri £ {0, . . . , maxflj, for i = 1, . . . , c. 

It can be easily checked that, conditional on the joint distributions of 

{V r Vf> ■■■V? a c Yi}iLi and {Yi}f =l are the same. Therefore, conditional on {Y t }® =1 _ s , 
the (negative) log-likelihood function of {Yi} can be approximated by the (negative) 
Whittle log-likelihood function (see Hosoya [8]) 



f(£,i?,a 2 ) = ]T>g7K;£,i?,a 2 ) 



(7) 



where <jj := 2nj/N € (0,7t) are the Fourier frequencies, T is the largest integer < 
(TV - l)/2, J^i?) = | Ef=i C/,(i?)exp(i^)| 2 /(27tiV), and ^(iJ) = V r V^ • • • Vf;F l; 
i = 1, . . . , JV. In (7), the computation of f(oJj ; £, R, <r 2 ) requires evaluation of an infinite 
sum. Here, we adopt the method of Chambers [4] to approximate /(w; £, i?, <r 2 ) by 



f(u;ti,R,a 2 ) 



Sill 



2r+2 c 



n 



-2D,- 



n 



h(u;£,R), 



where h(u>;£, R) = {2n(2r + 2d+ l^i^nM - cj)- 2 ^ 2 ^ 1 + (2nM + w)- 2 ^ 2 ^ 1 } + 
Efc=-jw |w + 2fc7T| _2r_2d_2 for some large integer M. By routine analysis, it can be shown 
that, under the conditions stated in Theorem 2, the approximation error of h(oj;R,^) 
to the infinite sum is of order 0(M~ 2r ~ 2d ~ 2 ). Also, the approximation error of the 
first partial derivative with respect to d is of order 0(M~ 2r " 2d " 1-e ), for any positive e 
less than 1. These error rates guarantee that if the truncation parameter M increases 
with the sample size at a suitable rate, then the truncation has negligible effects on the 
asymptotic distribution of the estimator, see Theorem 2 below. Replacing f(ojj;^,R,a 2 ) 
by f(ujj;l;,R,a 2 ) and letting g{ujf,^,R) = f(ujj;^,R,a 2 )/a 2 , the (negative) Whittle log- 
likelihood function (7) now becomes 

- m R, a 2 ) = £ { log o 2 + logsK- ; R) + fefr^ 



(8) 



Seasonal long-memory time series 



7 



Differentiating (8) with respect to a 2 and equating to zero gives 

Substituting (9) into (8) yields the objective function 

- fa R) = ]T logff ( Uj ; £, i?) + Tlog ( £ + C ' ( 10 ) 

3=1 v 

where C = T — TlogT. The objective function is minimized with respect to £ and i? 
to get the spectral maximum likelihood estimators £ and i?; the estimator a 2 is then 
calculated by (9). Specifically, the spectral maximum likelihood estimators £ and R are 
computed based on equation (10) using the following procedure (Recall that < Dj < 
1/2. for j = l,...,c, and < d + D\ + ■ ■ ■ + D c < 1/2). For each f G {0, .. . ,max r } and 
R L G {0, . . . , maxflj, for i = 1, . . . , c, we first find the local maximum likelihood estimator 

of £ in the range that < Dj < 1/2, j = 1, . . . , c, and f<r + d + Z?H h D c < f + 1/2. 

In our experiments, we let max r = max^ = ••• = max# c = 2. These local maximum 
likelihood estimators are then used to find the global maximum likelihood estimator of 
£ and R = (r, Ri , . . . , R c ) . 

For simplicity, let 9 = (£,<t 2 ), 6 = (£,<r 2 ) and R be the spectral maximum likeli- 
hood estimator that minimizes the (negative) Whittle log-likelihood function (8). Be- 
low, we derive the large-sample distribution of the spectral maximum likelihood estima- 
tor. 

Theorem 2. Let the data Y = {Yi\f =l be such that {VV* •••Vf c c y i }^ 1 is sampled 
from a stationary Gaussian seasonal long-memory process with the spectral density given 
by (6). Let the spectral maximum likelihood estimator 9 € O, a compact parameter space, 
and the true parameter 9q be in the interior of the parameter space. Assume that each 
component of R= (r, R±, . . . ,R C ) is known to be between and some integer K. Let ro 
and do be the true values of r and d, and the truncation parameter M increase with the 
sample size so that M — > oo . Then the spectral maximum likelihood estimator R and 9 are 
consistent. Moreover, if y/~N M~ 2r °~ 2d °~ 1 — >• as iV — s- oo, then \/N(6 — 9o) converges 
in distribution to a normal random vector with mean and covariance matrix T(9q)^ 1 
with 

r(0) = ^f (vlog/(w;i?,#))(vlog/(w;i?,tf))'d w , (11) 

where V denotes the derivative operator with respect to 9, and superscript ' denotes 
transpose. 
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5. Empirical comparison between the limiting model 
and the SARFIMA model 



Given aggregation is finite in practice, fitting the limiting model (6) to aggregate data 
may result in bias, even though the bias vanishes with increasing aggregation. On the 
other hand, "to some extent, a discrete time series model is conditional on the time 
scale," as remarked by a referee. So, it is pertinent to compare the empirical perfor- 
mance of the long-memory parameter estimators based on the proposed limiting model 
with those based on the SARFIMA model fitted to the aggregate data. As aggregation 
carries a signature in the long- memory data structure as spelt out in Theorem 1, fitting 
a SARFIMA model to aggregate data may result in even larger bias on the long-memory 
parameters than the limiting model. Here, we report some simulation results for clari- 
fying the aforementioned issue. Consider the aggregated time series {Yi}f =1 _ s such that 
{VV^li}^! is a stationary process with its spectral density defined by 



f(u;r,d,D,a 2 ) = a< 





2r+2 










sin(|) 













-2D 



h-1 

£ 

k=-h 



(2m) sin 



2/ctt 



2m 



-2r-2d-2 



(12) 



.9 

\ TO 



where g(ui) = |(/)(exp(iw))| 2 , 4>{x) = 1 — fax, 5 — max r + maxflZ, and < ui < n. For 
— 7t < lj < 0, f(uj;r,d,D,a 2 ) = /( — cj; r, d, D, a 2 ). We consider a = 2, z = 10, and r = 
R = 0. The true values of (d,D) are (i) (-0.1,0.3) and (ii) (0.2,0.25), whereas those 
of the other parameters are given in Table 1. The sample sizes considered are N = 512 
and N = 1024. We tried a range of AR(1) coefficient fa: -0.9,-0.5,0.0,0.5, and 0.9. 
The aggregation size m are set to be 60, 240, and 720, corresponding to the cases that 
minutely data arc aggregated over one hour, four hours, and half a day, respectively. To 
each aggregated time series simulated from model (12), we fitted (i) the limiting aggregate 
model, and (ii) the SARFIMA model. The averages and the standard deviations, as well 
as the asymptotic standard errors, of 1000 replicates of the estimators for (i) (d, D) = 
(—0.1,0.3) and (ii) (d,D) — (0.2,0.25) are summarized in Tables 1 and 2, respectively. 
Note that the estimates of r and R equal zero for all simulations. 

From Tables 1 and 2, it can be seen that the bias of the estimator of d for the limiting 
aggregate model is generally smaller in magnitude than that of the SARFIMA model, 
except for (d,L>,0i,m) = (-0.1,0.3,0.9,60), and (d, D, fa, m) = (-0.1,0.3,0.9,240). For 
(d, D) = (0.2, 0.25), the bias of the estimator of d + D for the limiting aggregate model is 
always smaller in magnitude than that for the SARFIMA model. For (d, D) = (—0.1, 0.3), 
the bias of d + D for the limiting aggregate model is smaller in magnitude than that 
for the SARFIMA model if fa = -0.9 or {(N,m,fa)\N = 1024, m = 720, fa ^ 0.9}. For 
(d,D) = (—0.1,0.3), the bias of the estimator of D for the limiting aggregate model is 
always smaller than or equal to that for the SARFIMA model, in magnitude. For (d, D) = 
(0.2, 0.25), the bias of the estimator of D for the limiting aggregate model is always larger 



Table 1. Averages (standard deviations) of 1000 simulations of the spectral maximum likelihood estimators of the parameters 
d, and D by fitting the limiting aggregate model (6) and the SARFIMA model (2), respectively, to aggregate data generated 
according to (12), with (d,D) = (—0.1,0.3). The asymptotic standard errors for the estimators of (d,D,d + D) are (0.03,0.03,0.04) 
and (0.02, 0.02, 0.03) for N = 512 and N — 1024, respectively. Results under column heading "A" denote those from the limiting 
aggregate model (6), whereas those under "S" are the counterparts from the SARFIMA model (2) 

AT = 512 N = 1024 



m m = 60 m = 240 m = 720 m = 60 m = 240 m = 720 
True 



01 


meter 


value 


A 


S 


A 


S 


A 


S 


A 


S 


A 


S 


A 


S 


-0.9 


d 


-0.1 


-0.163 


-0.210 


-0.126 


-0.163 


-0.113 


-0.147 


-0.162 


-0.206 


-0.125 


-0.160 


-0.112 


-0.144 








(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.02) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 


(0.03) 




D 


0.3 


0.325 


0.329 


0.323 


0.325 


0.322 


0.324 


0.318 


0.322 


0.315 


0.318 


0.314 


0.317 








(0.04) 


(0.04) 


(0.04) 


(0.04) 


(0.04) 


(0.04) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 




d + D 


0.2 


0.162 


0.119 


0.197 


0.162 


0.210 


0.177 


0.157 


0.117 


0.190 


0.158 


0.202 


0.173 








(0.05) 


(0.05) 


(0.05) 


(0.05) 


(0.05) 


(0.05) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


-0.5 


d 


-0.1 


-0.112 


-0.146 


-0.105 


-0.138 


-0.103 


-0.135 


-0.111 


-0.143 


-0.105 


-0.135 


-0.103 


-0.133 








(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.02) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 


(0.03) 




D 


0.3 


0.322 


0.324 


0.322 


0.323 


0.322 


0.323 


0.314 


0.316 


0.314 


0.316 


0.314 


0.316 








(0.04) 


(0.04) 


(0.04) 


(0.04) 


(0.04) 


(0.04) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 




d + D 


0.2 


0.211 


0.178 


0.217 


0.186 


0.218 


0.188 


0.203 


0.174 


0.209 


0.180 


0.211 


0.183 








(0.05) 


(0.05) 


(0.05) 


(0.05) 


(0.05) 


(0.05) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 





d 


-0.1 


-0.101 


-0.133 


-0.102 


-0.133 


-0.102 


-0.134 


-0.100 


-0.131 


-0.101 


-0.131 


-0.101 


-0.131 








(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.02) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 


(0.03) 




D 


0.3 


0.322 


0.323 


0.322 


0.323 


0.322 


0.323 


0.314 


0.316 


0.314 


0.316 


0.314 


0.316 








(0.04) 


(0.04) 


(0.04) 


(0.04) 


(0.04) 


(0.04) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 




d + D 


0.2 


0.221 


0.191 


0.220 


0.190 


0.220 


0.190 


0.213 


0.185 


0.212 


0.184 


0.212 


0.184 








(0.05) 


(0.05) 


(0.05) 


(0.05) 


(0.05) 


(0.05) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 



Table 1. (Continued) 



AT = 512 iV = 1024 



Para- 


True 


m = 60 




m = 240 




m = 720 




m = 60 




m = 240 




m = 720 




























4>i meter 


value 


A 


S 


A 


S 


A 


S 


A 


S 


A 


S 


A 


S 


0.5 d 


-0.1 


-0.092 


-0.121 


-0.099 


-0.130 


-0.101 


-0.132 


-0.091 


-0.119 


-0.098 


-0.128 


-0.100 


-0.130 






(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.02) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 


(0.03) 


D 


0.3 


0.321 


0.323 


0.322 


0.323 


0.322 


0.323 


0.313 


0.315 


0.313 


0.315 


0.313 


0.315 






(0.04) 


(0.04) 


(0.04) 


(0.04) 


(0.04) 


(0.04) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


d + D 


0.2 


0.230 


0.201 


0.223 


0.193 


0.221 


0.191 


0.222 


0.195 


0.215 


0.187 


0.213 


0.186 






(0.05) 


(0.05) 


(0.05) 


(0.05) 


(0.05) 


(0.05) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


0.9 d 


-0.1 


-0.040 


-0.060 


-0.085 


-0.113 


-0.095 


-0.126 


-0.041 


-0.059 


-0.085 


-0.111 


-0.095 


-0.124 






(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.02) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 


(0.03) 


D 


0.3 


0.320 


0.320 


0.321 


0.322 


0.321 


0.323 


0.311 


0.311 


0.313 


0.314 


0.313 


0.315 






(0.04) 


(0.04) 


(0.04) 


(0.04) 


(0.04) 


(0.04) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


d + D 


0.2 


0.279 


0.260 


0.236 


0.209 


0.226 


0.197 


0.270 


0.252 


0.228 


0.203 


0.218 


0.191 






(0.05) 


(0.05) 


(0.05) 


(0.05) 


(0.05) 


(0.05) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 



Table 2. Averages (standard deviations) of 1000 simulations of the spectral maximum likelihood estimators of the parameters 
d, and D by fitting the limiting aggregate model (6) and the SARFIMA model (2), respectively, to aggregate data generated 
according to (12), with (d,D) — (0.2,0.25). The asymptotic standard errors for the estimators of (d,D,d + D) are (0.03,0.03,0.04) 
and (0.02, 0.02, 0.03) for N = 512 and N — 1024, respectively. Results under column heading "A" denote those from the limiting 
aggregate model (6), whereas those under "S" are the counterparts from the SARFIMA model (2) 









AT CT 1 O 
I\ = 512 












N = 1024 












Para- 


True 


m = 60 




m = 240 




m = 720 




m = 60 




m = 240 




m = 720 




01 


meter 


value 


A 


S 


A 


S 


A 


S 


A 


S 


A 


S 


A 


S 


-0.9 


d 


0.2 


0.186 


0.223 


0.197 


0.231 


0.198 


0.232 


0.189 


0.225 


0.198 


0.234 


0.200 


0.235 




D 


0.25 


(0.03) 
0.260 


(0.03) 
0.250 


(0.03) 
0.258 


(0.03) 
0.247 


(0.03) 
0.258 


(0.03) 
0.247 


(0.02) 
0.258 


(0.02) 
0.250 


(0.02) 
0.256 


(0.02) 
0.248 


(0.02) 
0.257 


(0.02) 
0.246 








(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 




d + D 


0.45 


0.446 


0.474 


0.455 


0.479 


0.456 


0.479 


0.446 


0.476 


0.454 


0.482 


0.457 


0.481 








(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 


-0.5 


d 


0.2 


0.197 


0.230 


0.196 


0.231 


0.198 


0.232 


0.198 


0.234 


0.199 


0.235 


0.200 


0.236 








(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.02) 


(0.02) 


(0.02) 


(0.02) 


(0.02) 


(0.02) 




D 


0.25 


0.258 


0.247 


0.258 


0.247 


0.258 


0.248 


0.257 


0.248 


0.256 


0.246 


0.257 


0.246 








(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 




d + D 


0.45 


0.455 


0.478 


0.454 


0.479 


0.456 


0.480 


0.454 


0.481 


0.455 


0.482 


0.457 


0.483 








(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 





d 


0.2 


0.198 


0.232 


0.199 


0.231 


0.199 


0.231 


0.200 


0.236 


0.199 


0.235 


0.200 


0.236 








(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.02) 


(0.02) 


(0.02) 


(0.02) 


(0.02) 


(0.02) 




D 


0.25 


0.258 


0.247 


0.258 


0.246 


0.258 


0.246 


0.257 


0.246 


0.256 


0.247 


0.256 


0.246 




d + D 


0.45 


(0.04) 
0.456 


(0.03) 
0.479 


(0.04) 
0.457 


(0.03) 
0.477 


(0.04) 
0.456 


(0.03) 
0.478 


(0.03) 
0.457 


(0.02) 
0.483 


(0.03) 
0.456 


(0.02) 
0.482 


(0.03) 
0.456 


(0.02) 
0.482 








(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 



Table 2. (Continued) 



AT = 512 iV = 1024 



Para- 


True 


m = 60 




m = 240 




m = 720 




m = 60 




m = 240 




m = 720 




























4>i meter 


value 


A 


S 


A 


S 


A 


S 


A 


S 


A 


S 


A 


S 


0.5 d 


0.2 


0.201 


0.233 


0.199 


0.233 


0.197 


0.231 


0.202 


0.238 


0.200 


0.236 


0.200 


0.235 






(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.02) 


(0.02) 


(0.02) 


(0.02) 


(0.02) 


(0.02) 


D 


0.25 


0.257 


0.245 


0.258 


0.246 


0.258 


0.248 


0.256 


0.245 


0.256 


0.246 


0.256 


0.247 






(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 


d + D 


0.45 


0.459 


0.478 


0.457 


0.479 


0.455 


0.479 


0.458 


0.483 


0.456 


0.482 


0.456 


0.483 






(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 


0.9 d 


0.2 


0.233 


0.263 


0.201 


0.237 


0.200 


0.233 


0.236 


0.265 


0.206 


0.241 


0.201 


0.237 






(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.02) 


(0.02) 


(0.02) 


(0.02) 


(0.02) 


(0.02) 


D 


0.25 


0.246 


0.230 


0.254 


0.244 


0.258 


0.247 


0.247 


0.231 


0.256 


0.244 


0.256 


0.246 






(0.03) 


(0.03) 


(0.03) 


(0.03) 


(0.04) 


(0.03) 


(0.02) 


(0.02) 


(0.02) 


(0.02) 


(0.03) 


(0.02) 


d + D 


0.45 


0.479 


0.492 


0.455 


0.481 


0.458 


0.480 


0.489 


0.495 


0.462 


0.486 


0.457 


0.483 






(0.03) 


(0.01) 


(0.04) 


(0.03) 


(0.04) 


(0.03) 


(0.02) 


(0.01) 


(0.03) 


(0.02) 


(0.03) 


(0.02) 



Seasonal long-memory time series 



13 



than that for the SARFIMA model except for (<f> lt m) = (0.9,60) and (0i,m) = (0.9,240). 
Overall, these limited simulation results suggest that the limiting model leads to generally 
less biased estimates of d, and with comparable estimates of D, than the SARFIMA 
model. Possible gains of long-term forecast accuracy due to lesser bias in the estimator 
of d based on the limiting model will be further explored in the real application below. 



6. Application 

In this section, we report some analysis of a time series of counts of http requests to 
a World Wide Web server at the University of Saskatchewan, Canada, between 1 June 
and 31 December in year 1995, within the framework of the limiting aggregate sea- 
sonal long-memory model and spectral maximum likelihood estimation. The original 
data set consists of time stamps of 1-second resolution, which can be downloaded from 
http://ita.ee.lbl.gov/html/contrib/Sask-HTTP.html. Palma and Chan [15] ana- 
lyzed the 30-minute (non-overlapping) aggregates, that is, each data point represents 
the total number of requests sent to the Sakastchewan's server within a 30-minute in- 
terval. There are 9074 observations in total. To make the data more Gaussian and to 
stabilize their variances, Palma and Chan [15] applied a logarithmic transformation to 
the aggregate data. See Figure 1 for the time series plot, the sample autocorrelation 
function, and the periodogram of the transformed aggregate data. Their fitted model 
is a SARFIMA(l,d,l) x (0,L>,0) S model with (d,D,<p,§) = (0.076,0.148,0.917,0.583). 
Although this model explains roughly two thirds of the total variance of the data, the 
residuals display significant autocorrelations at several lags, in particular, at lags from 
40 to 50 (Figure 6(a) of Palma and Chan [15]), suggesting a lack of fit. Hsu and Tsai [9] 
also analyzed the same data set, pointing out the presence of both daily and weekly per- 
sistency in the data. Indeed, observe that there are two major peaks in the periodogram: 
one at the origin and another at frequency u> = 2rcx 189/9074 = 0.1309. These features 
indicate a possible seasonal long- memory process with z = 48, that is, a daily pattern. 
The third peak is at frequency uj = In x 27/9074= 0.0187, indicating a possible weekly 
pattern. 

Here, we reanalyze this dataset with the limiting aggregate seasonal long-memory 
model defined by (6) with c = 3, r = R\ = R2 = R3 = 0, z\ = 1, z 2 = 48 (corresponding 
to daily effects), and z 3 = 48 x 7 = 336 (corresponding to weekly effects). Note that 
m = 30 x 60 = 1800. Our new approach may be justified as the 30-minute aggregation 
may well fall within the intermediate asymptotic framework studied in Section 3. As 
discussed in Remark 3 of Section 3, we assume D\ = 0. Specifically, if {Y t } is the observed 
time series, the spectral density function of {Y t } can be written as 
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Figure 1. Time series plot, sample ACF, and periodgram of the Log transformed 30- minute 
(non-overlapping) aggregates, (a) Time series plot, (b) Sample ACF. (c) Periodogram. 



We have considered models of orders (Pi,Qi,P2,Q2,P3,Q3) = {Pi, Qi, 0,0, 0,0) with 
< Pi < 2, and < Q\ < 2. The model with the smallest AIC (Akaike information 
criterion) is (Pi, Qi, P2, Q2, P3, Q3) = (2,2,0,0,0,0). Goodness of fit of this model was 
studied in Chan and Tsai [5]. 

The spectral maximum likelihood estimates of the parameters and the 95% bootstrap 
confidence intervals based on steps 1-4 of Section 6 of Chan and Tsai [5] arc summarized 
in Table 3. The asymptotic standard deviations and the asymptotic 95% confidence 
intervals are also included in Table 3. It is clear that the bootstrap confidence intervals of 
the parameters are comparable to their asymptotic counterparts. The confidence intervals 
of the parameters d+ D% + P3, D2 and D3 indicate that the long-memory pattern, the 
daily seasonal long-memory pattern and the weekly seasonal long-memory pattern are 
all significant. 

To assess the advantage of using the proposed aggregation model in terms of forecast- 
ing, as compared to a SARFIMA model, we divide the data roughly into two halves, 
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Table 3. Spectral maximum likelihood estimates of the parameters of the model defined by 
equation (13), with (Pi, Q u P 2 , Q 2 , P 3 , Q 3 ) = (2, 2, 0, 0, 0, 0) 







Bootstrap 95% 


Asymptotic 


Asymptotic 95% 




Estimated 


confidence 


standard 


confidence 


Parameter 


value 


interval 


error 


interval 


(/ 


0.2326 


(0.1268, 0.2608) 


0.0436 


(0.1471, 0.3181) 


D 2 


0.1274 


(0.1085, 0.1429) 


0.0083 


(0.1111, 0.1437) 


D 3 


0.1271 


(0.1083, 0.1430) 


0.0083 


(0.1108, 0.1434) 


d + D 2 +D 3 


0.4871 


(0.3821, 0.5000) 


0.0441 


(0.4007, 0.5735) 


0i,i 


1.1277 


(0.8916, 1.5089) 


0.1256 


(0.8815, 1.3739) 


01,2 


-0.2610 


(-0.5773, -0.0508) 


0.1009 


(-0.4588, -0.0632) 


0i,i 


-1.1788 


(-1.4586, -0.9315) 


0.0936 


(-1.3623, -0.9953) 


#1,2 


0.3593 


(0.1237, 0.5755) 


0.0831 


(0.1964, 0.5222) 


a 


0.3117 


(0.3017, 0.3194) 


0.0051 


(0.3017, 0.3217) 



with the first 5074 data for fitting the proposed model and the SARFIMA model. We 
use the second half of data for comparing their forecasting performance by computing 
ft.-step ahead predictors, for h = 1, . . . , 4000, and the corresponding mean squared errors, 
via equations (5.2.19) and (5.2.20) of Brockwell and Davis [3], respectively. Specifi- 
cally, the competing model we consider is a SARFIMA(2, d, 2) x (0, D\, 0) S2 x (0, D 2 , 0) S3 
model, where s 2 = 48, and S3 = 48 x 7 = 336. The estimates of the parameters arc 
{d,D 2 ,D 3 J + D 2 + D 3 , 0\,i, 0i )2 , 0i,2, a) = (0.1996,0.1328,0.1280,0.4604,-0.038, 
0.8154,0.1099,-0.7342,0.1250). Note that the estimators of d and d + D 2 + D3 from 
the SARFIMA model are smaller than those from the proposed model, which is similar 
to some finding reported in Section 5. Figure 2 displays the ratios (in %) of the cumu- 
lative first h steps ahead mean absolute forecast errors of the SARFIMA model to their 
counterparts of the limiting aggregate model, for h = 1, 2, ... , 4000. These ratios measure 
the long-term forecast efficiency of the proposed model relative to the SARFIMA model. 
As can be seen from the figure, the proposed model produces more accurate long-term 
forecast than the SARFIMA model once the forecast horizon is approximately longer 
than 1 day. We have also examined the rates the h-step ahead prediction variances ap- 
proach their asymptotic value for the two models, and found that the fitted proposed 
model admits a slower convergence rate than the fitted SARFIMA model, which is con- 
sistent with the longer memory (at the zero frequency) estimated by the fitted proposed 
model than the SARFIMA model. 

7. Concluding remarks 

We have derived the limiting structure of the temporal aggregates of a (possibly non- 
stationary) SARFIMA model, with increasing aggregation. We have also obtained some 
asymptotic properties of the spectral maximum likelihood estimator of the limiting 
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Figure 2. Long-term forecast efficiency of the proposed model relative to the SARFIMA model. 



model, including consistency and asymptotic normality. Monte Carlo experiments show 
that the proposed method enjoys good empirical properties. Moreover, estimator of d 
under the proposed model appears to generally have smaller bias than that from fit- 
ting a SARFIMA model to aggregate data. The efficacy of our proposed methodology is 
illustrated with an analysis of an internet traffic data. Model diagnostic using a boot- 
strap procedure in the frequency domain, as presented in Chan and Tsai [5], suggests 
a good fit. Future research problems include extending the model to include covariates 
and developing other tools for model diagnostics. 
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